Optimal estimation of group transformations using entanglement 
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We derive the optimal input states and the optimal quantum measurements for estimating the 
unitary action of a given symmetry group, showing how the optimal performance is obtained with 
a suitable use of entanglement. Optimality is defined in a Bayesian sense, as minimization of the 
average value of a given cost function. We introduce a class of cost functions that generalizes the 
Holevo class for phase estimation, and show that for states of the optimal form all functions in 
such a class lead to the same optimal measurement. A first application of the main result is the 
complete proof of the optimal efficiency in the transmission of a Cartesian reference frame. As a 
second application, we derive the optimal estimation of a completely unknown two-qubit maximally 
entangled state, provided that N copies of the state are available. In the limit of large N, the fidelity 
of the optimal estimation is shown to be 1 — 3/(4iV). 

PACS numbers: 03.65.Ta, 03.67.-a 



I. INTRODUCTION 

Many of the most surprising advantages offered by the 
new technology of Quantum Information [Q arise from 
the concept of quantum entanglement. Computational 
speed-up || |3) , quantum teleportation Q and dense cod- 
ing secure protocols in cryptography J S] , precision 
enhancement in quantum measurements J7, |) are just 
a short summary of some of the main lines of research 
inspired by entanglement. 

After a so promising list, it is natural to expect re- 
markable improvements coming from entanglement also 
in the context of Quantum Estimation Theory |9|, [To|| , 
in particular in the typical problem of estimating an un- 
known physical transformation drawn from a given set. 
With a heuristic argument inspired by dense coding, we 
could expect that the accuracy in the discrimination of a 
set of quantum channels can be increased by letting them 
act locally on a fixed side of a maximally entangled state. 
Even more, one is tempted to guess that a maximally en- 
tangled state is the optimal input for the estimation of 
an unknown black box. Even though these are both rea- 
sonable conjectures, in general they turn out to be false: 
for example, a maximally entangled input state is always 
useless — and often suboptimal — for the discrimination of 
two unitary transformations |llj. The question then 
arises: is it really possible to make some general state- 
ment about the role of entanglement in the optimal esti- 
mation of an unknown transformation? 

In this paper we will answer this question in the co- 
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variant case, which corresponds to the estimation of uni- 
tary transformations randomly picked out from a given 
representation of some group. To face the problem, we 
will choose the Bayesian approach, assuming a uniform 
a priori distribution for the unknown group parameters, 
and defining optimality as the minimization of the aver- 
age value of a given cost function. Within the Bayesian 
framework, some results about the optimality of max- 
imally entangled states have been presented in |t], fl2| . 
Other results in the same direction have been derived 
in [fl3[ |l4|| within a different approach based on quan- 
tum Cramer-Rao bound. However, all the mentioned 
results are limited to particular cases, and their exten- 
sion to arbitrary representations of arbitrary groups is 
not straightforward. 

Another nontrivial question is: which kind of entangle- 
ment is really useful for the estimation of group transfor- 
mations? In Ref . |Q , it has been considered the estima- 
tion of unitary transformations U g in §U(d) in the form 
U® , corresponding to N copies of the same unknown 
black box. The result is that the optimal performance 
can be attained by entangling the N d-level systems that 
undergo the unknown transformation with another set of 
N (i-level systems playing the role of a reference system. 
However, as pointed out in Ref. jl5|], the entanglement 
with an additional set of N reference systems actually is 
not needed: what really matters is something more sub- 
tle, namely the entanglement between spaces where the 
action of the group is irreducible and spaces where the 
action of the group is trivial. In the language of group 
theory, what is needed is maximal entanglement between 
representation spaces and multiplicity spaces. This kind 
of entanglement can be obtained not only by adding an 
external reference system as in , but also via the use 
of the multiple equivalent representations that appear in 
the Clebsch-Gordan decomposition of the representation 

The concept of entanglement between representation 
spaces and multiplicity spaces will be the protagonist 
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of this paper. In the following, we derive the optimal 
scheme for estimating an unknown group transformation, 
showing how this kind of entanglement allows to achieve 
the ultimate precision limits allowed by Quantum Me- 
chanics. To do this, we introduce a class of cost functions 
that generalize the well known Holevo class for phase es- 
timation , and show that all functions in such a class 
lead to the same optimal measurement. We give also an 
explicit expression for the average cost so that the opti- 
mization of the estimation scheme is reduced to a simple 
eigenvalue problem. 

In Sec. |0], before starting the analysis about optimal 
estimation strategies, we introduc e th e notation (II A) 
and some group theoretical tools ( [IB ) that will be ex- 
ploited throughout the paper. In Sec. |lj, we present the 
proble m of estimating an unknown group transformation 
( III A ), introducing a generalization to ar bitrary groups 
of the Holevo class of cost functions (III B). The optimal 
input states are then derived ( III C ), and the entangle- 
ment between representation and multiplicity spaces is 
recognized to be the basic resource for an optimal estima- 
tion strategy. In order to find the optimal measurement 
for the estimation of a group transformation, we show 
in Sec. Ill D how the special form of the optimal input 



states reflects on the covariance properties of the opti- 
m al mea surement. Exploiting this analysis, we will show 
in HIE that, for input states of the optimal form, all 
cost functions in the generalized Holevo class lead to the 
same optimal measurement. Finally, Sec. IV is devoted 



to applications of the general results. A first application 
(IV A) is the optimality proof of the protocol [|l5| for the 
absolute alignment of two reference frames. As a sec- 
ond application, we derive (IVB) the optimal estimation 
of a completely unknown two-qubit maximally entangled 
state with N identical copies of the state. Section V 
concludes the paper, while the most technical proofs are 
provided in the Appendix. 



II. THEORETICAL TOOLS 

A. Notation for bipartite states 

A simple notation can be introduced to deal with bi- 
partite states. Given two Hilbert spaces Ha and Hb , and 
fixed two orthonormal bases Ba = {\4>n) \ n = E ■ ■ • , Aa\ 
and Bb = {\ipn} | n = 1, . . . , ds} for Ha and Hb respec- 
tively, it is possible to associate in a one to one way any 
vector \C)) £ Ha®Hb with an operator C £ B(Hb,Ha) 
via the relation |l6| 

\C))=J2 (<t>™m n ) \4>m)\^n) ■ (1) 
m,n 

With this notation, one has the simple relations 

({C\D)) = Tr[&D] (2) 

and 

A®B \C)) = \ACB T )) , (3) 



for any A £ B(Ha) and B £ B(Hb), where transposition 
T is defined with respect to the fixed bases. Such rela- 
tions allow to greatly simplify the calculation involving 
entangled states, and will be extensively used throughout 
the paper. 



B. Elements of group theory 

Here we recall some simple tools of group theory |l7j] 
that will be exploited throughout the paper. 

Suppose we are given a Hilbert space H and a unitary 
representation R(G) = {U g £ B(H) \ g £ G} of a com- 
pact Lie group G. The Hilbert space can be decomposed 
into orthogonal subspaces in the following way 



H 



(4) 



where the sum runs over the set of irreducible represen- 
tations of G that appear in the Clebsch-Gordan decom- 
position of R(G). The action of the group is irreducible 
in each representation space H^, while it is trivial in the 
multiplicity space C mM , namely 



U, 



= U£®1 T . 



(5) 



Id denoting the identity in a d— dimensional Hilbert 
space. The projection n M onto the subspace H^ ® C™ 1 ' 1 
is given by the integral formula 



n„ 



d 9X^{9) u g 



(6) 



where d g denotes the normalized invariant Haar mea- 
sure (dg — d(kg) = d(gk) for any k,g £ G), = 
dim(H M ), and x M (g) = Tr[Ujf] is the character of the 
irreducible representation \x. Note that here we are con- 
sidering G as a continuous group only for fixing notation, 
nevertheless — here and all throughout the paper — G can 
have a finite number of elements, say |G|, and in this 
case we have simply to replace integrals with sums and 
dg with 1/|G|. 

Moreover, any operator O £ B(H) in the commutant 



of R(G)- 
form 



-i.e. such that [O, U g ] — \/g £ G — has the 



O 



V, 

pes 



(7) 



where is a x m M complex matrix. In particular, the 
group average (^4)g = / dg U g AU^ of a given operator 
A with respect to the invariant Haar measure is in the 
commutant of R(G), and has the form: 



(8) 
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where Tr-n [A] is a short notation for Tr-n [II M A H^] , 11^ 
being the projection onto 7i M <g> C" 1 ^ . Here and through- 
out the paper we assume the normalization of the Haar 
measure: J G dg = 1. 

Remark I: entanglement between representation 
spaces and multiplicity spaces. 

The choice of an orthonormal basis 
B M = £ C m » | n = l,...,m M } for a 

multiplicity space fixes a particular decomposition of the 
Hilbert space as a direct sum of irreducible subspaces: 

® c m * = C=i H£ , (9) 

where = 7i M <g)|<^). In this picture, it is clear that 
is the number of different irreducible subspaces carrying 
the same representation /i, each of them having dimen- 
sion d^. Moreover, with respect to the decomposition 
(||), any pure state \ty) € H can be written as 

l*> = <V , (10) 

where is a bipartite state in TC^ €5 C m f and 

J2fi.es I c mI 2 = 1- With respect to the direct sum decom- 
position the Schmidt number of such a state is the 
minimum number of subspaces carrying the same repre- 
sentation p £ S that are needed to decompose l^). 

Remark II: maximum number of equivalent repre- 
sentations in the decomposition of a pure state. 
The Schmidt number of any bipartite state 
£ Ti-fj. ® C m " is always less then or equal 
to kfj_ = min{rf M , m M }. This means that any pure state 
can be decomposed using no more than k^ irreducible 
subspaces carrying the same representation p £ 5. 



III. OPTIMAL ESTIMATION OF GROUP 
TRANSFORMATIONS 

A. Background problem 



Our task is to find the best input states and the best 
estimation strategies allowed by Quantum Mechanics to 
determine the parameter g. Since we are interested in ul- 
timate in-principlc limits, we assume complete freedom in 
preparing any physical state and in realizing any quan- 
tum measurement. This means that we are allowed to 
choose the state p sn with minimal stability group, re- 
ducing the set of unitaries that are not discriminable to 
those that differ just by a phase factor. Therefore, the 
stability group can be only a (nontrivial) center for G, 
made of multiples of the identity, corresponding to (a 
subgroup of) U(l). The quotient group is then a group 
itself, and in the following we will use the same symbol G 
for such a quotient group. Notice that the requirement of 
central stability group U(l) is satisfied by choosing the 
state p as pure, and with maximal Schmidt number. 

The most general estimating strategy allowed by quan- 
tum mechanics, including both quantum measurements 
and classical data processing, can be described by a Pos- 
itive Operator Valued Measure (POVM) M that asso- 
ciates to any estimate g £ G a positive semidefinite op- 
erator M(g), satisfying the normalization condition 

f dg M(g) = 1 . (13) 
Jg 

The probability density of the estimate g in the state p g 
is given by the usual Born rule: 

p(g\g) = Tr[ Pg M(g)] . (14) 

In this paper, the estimation problem will be faced 
in the Bayesian setting with prior uniform probability 
density dg, and the optimal estimation will be defined 
as the one that minimizes the average value of a given 
cost function c(g,g) that associates to any estimate g a 
cost which increases versus the "distance" of g from the 
true value g. The average of the cost function over the 
prior and the conditional probability distributions will be 
given by 

(c) = [dg [ dgc{g,g)p{g\g) . (15) 



Suppose we are given a black box that performs on 
a system S an unknown unitary transformation U g ran- 
domly drawn from a group representation R(G). In order 
to estimate the transformation U g , we can prepare the 
system in an input state p s , send it through the black 
box, and try to estimate the parameter g from the output 
state 

P S g EE U gP S Ul . (11) 

More generally, we can also exploit an additional refer- 
ence system 1Z and prepare an entangled state p sn , so 
that the output state becomes 

p s g n ee (U g ® l n ) P sn (Ul ® In) ■ (12) 



B. A generalized Holevo class of cost functions 

We will make two assumptions on the form of the cost 
function c(g, g). 

First assumption. We require c to be group invariant, 
namely 

c(g, g) = c(kg, kg) V.g, g, k £ G (16) 

(left-invariance), and 

c(g,g) = c(gk,gk) Vg,g,k£G (17) 

(right-invariance) . By using Fourier analysis, one can 
prove (see Appendix) that this assumption is equivalent 
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to the expansion 



(18) 



where x a (g) = ^[[/^(g)] is the character of the irre- 
ducible representation a, and the coefhcients a a satisfy 
the identity a* = a„* Ver, in order to have a real cost 
function. 

Second assumption. We require all nonzero coefficients 
a a in the expression dl§| ) to be negative, with the only 
exception of the coefficient a aa corresponding to the triv- 
ial representation U a °(g) = 1 Vg, which is allowed to be 
positive (the cto term just adds a trivial constant to the 
cost function, since X CTo Q?) = 1 Vp). 

The class of functions that satisfy our two assumptions 
is a direct generalization of the class of cost functions 
introduced by Holevo for the estimation of phase shifts 
[IToll - In fact, such functions have the form 



E 

fcez 



a k e 



— ik(<fi — (j 



(19) 



where < for any k ^ 0, and e %k ^ is the character of 
the unidimcnsional representation labeled by k. 



C. Optimal choice of the input state 

Since the average cost ( |l5| ) is a linear functional of the 
input state p, in the optimization problem we can re- 
strict attention to pure input states p = |\E')(\I/|. Then 
the problem becomes equivalent to the optimal discrim- 
ination problem of states in the orbit 



(D = {\* g )^U g |*) |.geG} 



(20) 



generated from |\&) by the action of the representation 
R(G). 

Let's consider the Clebsch-Gordan decomposition (g) 
of the unitaries U g . From now on we will assume the 
algebraic condition 



Lemma 1 The assumption 
loss of generality. 



A) can be done without any 



Proof. Suppose > for some representation /i. In 
this case, we can introduce a reference system 1Z whose 
dimension is 



dn > max 



(22) 



the other hand, as already mentioned at the end of Sec. 
P B| , any pure state can be decomposed in the form 
(|10|) with no more than fc M = minjoJ^, m M } irreducible 
subspaces for any fi. Therefore, we can switch our atten- 
tion from the whole Hilbert space Tt (8 Tin = W M <8> 

C"V to the invariant subspace TC = M H^®C d ", 
which contains the input state 1^) along with its orbit 
(p0|). In other words, without loss of generality we can 
always consider an input state in the Hilbert space 



W = H„ 



(23) 



which can be thought as embedded in a larger Hilbert 
space TL <g> TL-r- B 

Remark. The need of adding an external reference 
system 1Z arises only in the case when d^ > m M for 
some irreducible representation /i. In fact, the role of 
the reference system is simply to increase the number 
of equivalent representations until the extended Hilbert 
space TL £g> TL-r, reaches the threshold to m > rf M V/i. This 
observation allows to greatly reduce the dimension of the 
reference system with respect to the customary estima- 
tion schemes inspired by dense coding, with a reference 
system TL-r having the same dimension of TL. 

Now we show that the best input state |\E') for estimat- 
ing the group transformation of an unknown black box 
is a state of the form (|To|) , with each l^)) maximally 
entangled, namely 



1 



n=l 



\<Wu) 



(24) 



n = l,...,d p } and % = {|#J> 



k = m) 

1, . . . , dp,} being Schmidt bases for TL^ and C dfI respec- 
tively. Exploiting the notation (Q) — with fixed bases 
and Bg — the optimal input state \^f) must have the form 



l*>=© 



\w»)) 



(25) 



(21) with Wf, = Y. n l^n)(^nl unitary operators. 



Theorem 1 (optimal input states) With a suitable 
choice of the coefficients {c M } ; any input state of the form 
achieves the minimum average cost. 



Suppose that the minimum cost 



\Opt 



is achieved by the 



input state |$) = ® c M |$ A 



strategy described by the POVM M(g) 

T 



along with the estimation 
The operator 



K, 



i, 



converts the orbit of 



an input state (25) into the orbit of the optimal input 
state | < I ) ) ) since using identity (|), we have 



K h |* fl 



I* 



and replace U g with its extension U' g = U g ®~t-R, acting in 
the tensor product Hilbert space TL®TL-r. In this way, U' wh ere |* 9 ) = U g \9) and |$ fl 



gh ) 



(26) 

Consider now 



will satisfy the condition m' 



xd n >d„ V>. On the POVM M'{g) = J ' dh K\ M(gh) K h . The POVM 
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M'(g) is normalized, since 

J dg M'{g) = JdgJ AhK{ M{gh) K h 

= Jdh K\K h 



where we exchanged integrals over g and h, used invari- 
ance of the Haar measure dg, and finally used Eq. (||) 
and the normalization of bipartite states \&u)) hi the form 
Tr[$t = 1. A state |W) of the form @ along with 
the POVM M'(g) achieves the minimum cost. In fact, 



where 



(c) = J dg J dgc(g,g) <* 9 | M\g) \9 g ) 

= Jdgjdgjdh c(g,g) (§ gh \ M(gh) \$ gh ) 

dgjdgj dh c{gh,gh)(<S> gh \ M(gh) \$ gh ) 

dk Jdk c{k,k) ($ fc | M{k) |$ fc > 
= (c)° pt , 

where we used right-invariance of both cost function and 
Haar measure. ■ 



D. Covariance properties of the estimating POVM 

Since the whole orbit ( p0| ) is generated from the input 
state |W) by the action R(G) of the group, there is no 
loss of generality in restricting attention to estimating 
POVM of the covariant form @ 

M fa) = U g 5 Ul (27) 

with 5 a suitable positive operator satisfying the nor- 
malization condition (|l3|). A covariant POVM yields a 
left-invariant probability distribution, namely p(kg\kg) — 
p(<?l<?) ^k,g,g G G. Using both the left-invariance of 
the probability distribution and of the cost function, the 
average cost ( |l5| ) can be written as 



dg c(g,e) p(g\e) 



(28) 



where e is the identity element of the group G. 

For superpositions of maximally entangled states as 
in Eq. (pq), the orbit O enjoys an additional symme- 
try that reflects on an additional covariance property of 
the POVM. In fact, using the decomposition (||) and the 
identity (||), we can note that 



= U g |*) 

= e- 

^es V 

-e J 



(U^l^) \W M )) 



V<? e G 



\Wu 



(29) 



is an element of a new unitary representation R'(G) of the 
group G. Notice that the two representations R(G) and 
R'(G) commute among themselves. Then, the following 
Lemma holds: 

Lemma 2 There is no loss of generality in assuming a 
covariant POVMM(g) = U g E W g with 



[S, U g V B ] = Vg G G 



(30) 



where U g and V g are given in Eqs. ^ and respec- 
tively. 

Proof. For any possible POVM N(g) there is a covari- 
ant POVM with the above property and with the same 
average cost. In fact, the group average 

M(g) = Jdk Jdh U\vl N{kghr l ) V h U k (31) 

is covariant — namely M(g) = U g 5 £7J with S = M(e)— 
and satisfies the required commutation relation (pQ). 
Both properties follow simply from the invariance of the 
Haar measure. To prove that the cost of the covariant 
POVM M(g) is the same as the cost of N(g) we use the 
property 

U k V h = \V kgh -i) Vk,h,g G G (32) 
of the states generated from the input (|2^) . In this way, 



(c)m 



dg j dgc(g,g) M(g) |* g ) 

= J dg J d ® J dk J dflC ^' 9 ^ X 

x(*k 9 h-A Nikgh- 1 ) \V kgh -x) 

= J dg J dg J dk J dh c(kgh^ 1 ,kghr x ) x 

A^kgh-A Nikgh- 1 ) \9 kgh -i) 

df c(f,r) (* r | N(f) |* r ) 



d r 

= (c)n , 

where we used the left- and right-invariance of the cost 
function c(g, g). I 

Let's diagonalize the operator S and express its (non- 
normalized) eigenvectors in the decomposition (||) : 

r 
i=l 

= E v^liMvil , (33) 

i ^i,u 

where the factor ^fol^ has been inserted just for later 
convenience. 
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Lemma 3 Any covariant POVM M(g) — U g E f/J with 
the commutation property must satisfy the two rela- 
tions: 



E i + i = *» v ^ e s . 



E « = ^ V M e S . 



Proof. The normalization CfL 31) becomes 



(34) 



(35) 



(S) G = JdgU g EUl = l 



(36) 



The group average (E)g can be expressed using Eq. (^). 
In this way, Eq. ( p6| ) becomes 

±Tr nii [Z] = l lx V/iGS. (37) 

By explicit computation, 

^Tr^[~] = E Tr^[|<))<«|] 

= E < Tr^[|l M »((l M |] r?» 

EiT i* 
% % ■ 

i 

Substituting this expression in ( |37| ) and taking the com- 
plex conjugate we get (|34|) . Moreover, using the commu- 
tation relation (30), we can transform the group average 
with respect to R(G) in a group average with respect to 
R'(G), namely 

<5)g = Jdg U g (U^ E U g V g ) U\ 
dg Vl E V g . 



In this way, using Eq. (|29|) , Eq. (|35|) can be proved along 
the same lines used to prove Eq. (p3). ■ 



E. The optimal POVM 

We are now able to find the optimal covariant POVM 
for the estimation of group transformation with superpo- 
sitions of maximally entangled states. 

Theorem 2 (optimal POVM) In the estimation of 
the states in the orbit O generated from the input state 



i*>=e -} 



\Wa 



(38) 



where are unitary operators, the covariant POVM 
given byE — \rj){rj\ with 



l»?>=© Vd~e iai ^ \W,)) 



(39) 



is optimal for any cost function c(g, g) of the form 

c(£, 5 )=E <VX CT *(&r 1 ) , (40) 

a 

with a„ < Vcr ^= o~o . 

The average cost corresponding to the optimal estimation 
strategy is 



Opt 



E ! c mI c ^ m 



where 



= E Ocr 



(41) 



(42) 



iria being the multiplicity of the irreducible representa- 
tion a in the Clebsch- Gordan series of the tensor product 



Proof. We will show that Eq. (|l]) gives a lower bound 
for the average cost, and that the POVM 3 = 
with 1 77) given by Eq. ( |39| ) achieves this bound. By using 
identitie s (B ) and (g), and the form (|3^) for the operator 
S, Eq. (g8|) becomes 



(c) = / dg c{g,e) x 

x E E c ^ Tr M u s ® ^ T • 

Let's expand c(<?, e) as in (f4(i|). Subtracting from the av- 
erage cost (c) the constant term a ao , which is not relevant 
for the optimization, we get 

(c) - a ao = E E c l c » x 

x E ? Tr (vlwl^ni* wj) , 

where we defined 

LT^ = d„ Jdg X a * (g) U£ ® C/;* . (43) 

According to (||), is the projection onto the direct 

sum of all the subspaces of TC^ ® H v that carry the irre- 
ducible representation a in the tensor product U£ (8 C/g*. 

Clearly n^"' is nonzero if and only if the Clebsch-Gordan 
series of U[f ® t/^ contains tr with nonzero multiplicity 

m^. Notice also that TrpI^] = d a rn£ v) , by defini- 



tion of n 
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Denoting by ^ v a the sum over fi, v and all a except 
<To, the average cost can be bounded as follows 



(c) - a ao > J- 



> 



\ 




since all a CT are nonpositive. The second inequality fol- 
lows from Cauchy-Schwartz inequality with respect to the 



A]B„ 



where we take 



scalar product (A, B) = Tr 

A\ = H ( r\vlWl <g> t v ) and Bi = (K ® C^J) 
Exploiting the relations and (pa), and using that 



Ti- 



ll 



d a m^ y \ we obtain the bound 



(c) > a an + 



(44) 



It is straightforward to see that the choice of a covariant 
POVM with S = \rf){r]\ with I77) given by ( p9| ) achieves 
this lower bound. ■ 



F. Remarks 

Remark I Up to the constant term a aa , the minimum 
cost (^) is simply given by the expectation value of the 
cost matrix (E4) over the normalized vector v = ( |c M | ). 
Therefore the optimal input state is obtained just by 
finding the eigenvector corresponding to the minimum 
eigenvalue of the cost matrix. In other words, the op- 
timal state for the estimation of an unknown parameter 
is always a superposition of maximally entangled states, 
with the coefficients in the superposition modulated by 
the particular choice of the cost function. Notice the 
simplification of the optimization problem provided by 
Theorem ||| instead of optimizing a state in the Hilbert 
space 7i = ®, l£ s ® ^ m>1 we nee d only to optimize 
a vector in K' s ', where |S| is the number of irreducible 
representations contained in the action of the black box. 

Remark II The optimal POVM of Theorem | is the 
same optimal POVM arising from the maximum likeli- 
hood criterion In fact, this criterion corresponds 
to the particular choice of the delta cost function 

c(g,g) = -5{g,g) 

= -E d -x a {gg~ l ) , 

which is of the form (f4(i|). In other words, in the case of 
superpositions of maximally entangled states, the result 



of Theorem [| can be viewed as the extension of the max- 
imum likelihood approach of Ref. [fil| to arbitrary cost 
functions. 

Remark III In the optimization of covariant POVM's 
it is often assumed that the operator S corresponding 
to an optimal estimation can be taken with unit rank. 
However, for mixed states some counterexamples are 
known |2G, 21 1, and for pure states there is no general 



proof that the POVM minimizing the average Bayes 
cost can be chosen with rank one. Therefore, it is 
important to emphasize that here the rank-one property 
of the optimal POVM of Theorem || is a result of the 
derivation, not an assumption. 



IV. APPLICATIONS 
A. Optimal transmission of reference frames 

The result of Theorem ^ can be exploited to give the 
definitive proof of optimality of the protocol for the abso- 
lute transmission of a Cartesian reference frame of Ref. 
filH , which concludes a long debate about the optimal 
way of communicating a reference frame p2| . Such a pro- 
tocol allows two distant parties, Alice and Bob, to align 
their Cartesian axes in an absolute way, i.e. without the 
need of any kind of prior information about their rela- 
tive orientations. To this purpose, Alice sends to Bob N 
spin-i particles, prepared in some fixed state. The prepa- 
ration procedure of the state is related to the directions 
of Alice's Cartesian axes: for example Alice can align 
the angular momenta of some particles with her x— axis, 
some with her y— axis, and so on. When Bob receives 
the particles, since his axes are mismatched with Alice's 
ones, each particle appears rotated by the same unknown 
rotation. Then, instead of receiving the particles in the 
same state prepared by Alice, Bob receives them in a ro- 
tated state. Clearly, if he knows how the state should 
look in absence of rotations, he can try to estimate the 
difference, i.e. he can estimate the unknown rotation, 
inferring in this way the directions of Alice's axes. The 
precision of this scheme is defined in a Bayesian way, by 
taking as cost function the transmission error, i.e. the 
distance between the directions of Alice's axes and Bob's 
axes at the end of the protocol. In terms of the estimated 
rotation g and the true one <?, the transmission error can 
be written as [|l5| 



e(g,g) = 6-2 X 1 (gg- 1 ) 



(45) 



where x 1 (.g) = Tr[C/^] is the character of the three dimen- 
sional irreducible representation of the rotation group. It 
is immediate to see that the transmission error is a cost 
function the form (|40|). 

What is the best precision that can be achieved with 
the mentioned protocol? To answer this question we need 
to solve two problems: the first is to find what is the opti- 
mal state for encoding rotations, and the second is to find 
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the optimal estimation strategy. It is important to stress 
that, since we want to achieve an absolute transmission, 
we are not allowed to use an external reference system, 
whose role would correspond to a partially shared refer- 
ence frame jlq]. For this reason we are allowed only to 
exploit the entanglement coming from the multiple equiv- 
alent representations that appear in the Clebsch-Gordan 
series of Uf N , where U g is the SU(2) matrix that repre- 
sent the rotation g in the two-dimensional Hilbert space 
TL of a single spin i particle. 

The tensor product Hilbert space TL® N can be decom- 
posed as in (| 



H 



= e 

i=o(|) 



Hj <8> C" 



(46) 



The irreducible representations are labeled by the quan- 
tum number j of the total angular momentum, which 
ranges from 0(|) to y for N being even (odd), respec- 
tively. The dimension of the representation space TLj is 



dj = 2j + 1 , 
while the multiplicities are given by p 



2j + l 



f +3 



1 



N 
f +3 



(47) 



(48) 



Since m,j > for any j < ^ , it is possible to have max- 
imal entanglement between representation spaces and 
multiplicity spaces for any j, with the only exception of 

3 = f • 

However, as shown in [ fEsf , the contribution of the sub- 
space with j = -j is negligible in the asymptotic limit 
of large TV. Therefore we can restrict ourself to the sub- 
— -l 

space TL = ffi? 7io ® C" 1 ^, and consider the state 
^=0(5) J 



\A)) 



e 

j=oa; 



(49) 



According to Theorem 0, and to the result of jT2) , this is 
the optimal state in the subspace TL' for the estimation 
of an unknown §U(2) rotation. 

Now we can use Theorem || to state that the optimal 
estimation strategy is described by the covariant POVM 
given by 5 = \r))(r)\ with 



\v)= VdM 

i=o(|) 



(50) 



The optimization of the coefficients Cj in the state ( [49] ) 
has been done in EH] , where the POVM (|5(i| ) was assumed 
by exploiting for simplicity the maximum likelihood ap- 
proach. In this way, the results of Theorem |l| and ^ 
provide the optimality proof for the protocol proposed 



in Ref. fL5[ , Therefore, we can definitely state that the 
asymptotic precision 



8tt2 
N 2 



(51) 



is the best that can be achieved for all input states and 
all POVM's, namely it is the ultimate precision limit im- 
posed by Quantum Mechanics in the absolute alignment 
of two Cartesian reference frames. 



B. Optimal estimation of a completely unknown 
maximally entangled state 

Maximally entangled states are a fundamental resource 
for quantum teleportation B and for quantum cryptog- 
raphy |(| . To achieve ideal teleportation, Alice and Bob 
must know with precision which maximally entangled 
state they are sharing, otherwise the fidelity of the state 
received by Bob with the original state from Alice can be 
lowered. Similar arguments apply to the cryptographic 
schemes where the correlations arising from entanglement 
are exploited to generate a secret key. 

Now we will consider the problem of estimating in 
the best way a completely unknown maximally entan- 
gled state, provided that N identical copies are available. 
This can be done as an application of Theorem ||. Let's 
consider a state S TL®TL, with dim(7Y) = d. In terms 
of the notation (pi) , this state is maximally entangled if 
and only if ip = —j= U, where U is some unitary operator. 

Using property (J3|), any maximally entangled state can 
be written as 



1 



\^ g )) = —{U g ®t) |1» 



(52) 



where U g is an element of the group §U(cZ). 

If N identical copies of the unknown state \tf> g } are 
given, then the problem becomes to find the best esti- 
mate for parameter g that labels the states of the form 
\^f g )) = \tp g ))® N . Optimality is defined here in terms of 
maximization of the Uhlmann fidelity between the true 
state and the estimated one: 

f(g,g) = \U g \iP g ))\ 2 . (53) 
Using the definition (|52|) and the property (||), we obtain 



f{g,g) 



1 

rf2 



\x{gg- l )t 



(54) 



where x(g) = Tr[C/ g ]. The maximization of the fidelity 
corresponds to the minimization of the cost function 



c(g,g) = 1- f(g,g) , 



(55) 



which is of the form ([40[). In particular, for d = 2, 
\x(g)\ 2 — l+xHff); where x 1 (ff) = Tr[t/g] is the character 
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of the irreducible representation of SU(2) with angular 
momentum j = 1 , whence we have 



(56) 



All the states of the form 



from the input state 
I*) 



\i/j g )}® N are generated 



1 



by the action of the representation 

R(G) = {(u g <g> i)® N I u g e SU(d)} . 



(57) 



(58) 



Now we need to know how the input state is decomposed 
with respect to the invariant subspaces of this represen- 
tation. 

Lemma 4 Using suitable bases for the muliplicity spaces 
in decomposition (Qj, the input state can be written 
as 

|*) = ® -|= , (59) 
pes V d p 

where the sum runs over the irreducible representations 
o/SU(d) occurring in the Clebsch-Gordan series o/R(G) 
and 



(60) 



d M and being respectively the dimension and the mul- 
tiplicity of the representation fi in the Clebsch-Gordan 
series of{Uf N }. 

Proof. See appendix. 

Thank to this lemma we can exploit directly the re- 
sult of Theorem to calculate the average fidelity. Now 
we will carry on the calculation of the optimal fidelity in 
the simplest case d = 2. As usual, the irreducible repre- 
sentations of SU(2) are labeled by the quantum number 
j, ranging from O(^) to y for N being even (odd), re- 
spectively. The minimum cost can be evaluated using 
Theorem ^ as 



On the other hand, the matrix Cy is calculated accord- 
ing to the definition ( fi^ ) , namely by evaluating the mul- 
tiplicity of the representation with angular momentum 
k = 1 in the Clebsch-Gordan series of the tensor product 



In this way we get 



(64) 



Since ^ 



1, we have 



■=o(i) 



■'j+i 



(65) 



To obtain the asymptotic behavior of the optimal fidelity, 
we can approximate the binomial distribution in (p2) 
with a Gaussian G a (x) with mean x — and variance 
a 2 = . Since the sum in ( p5| ) runs over a large interval 
with respect to a, we can also approximate it with an in- 
tegral over [0, +oo] . All these approximations hold up to 
order higher than j; ■ Thus the evaluation of the optimal 
fidelity is reduced to the evaluation of the integral 



I = 



dx g{x)g(x + 1) G a {x) 



(66) 



whose leading order can be obtained from Taylor expan- 
sion. In this way, we derive the asymptotic cost 



\Opt 



3 

In 



corresponding to the optimal fidelity 

3 

an ' 



(f)° pt - 1 



(67) 



(68) 



Remarkably, the Bayes cost with uniform a priori distri- 
bution has the same asymptotic behavior of the cost of 
the optimal locally unbiased estimator obtained in Jl4) , 
for any possible value g of the true parameter. This 
means that in the present unbiased case the covariant 
measurement of Theorem || is optimal not only on aver- 
age but also pointwise. 



(c) 



Opt _ 



y=o(i) 



G; 



(61) 



Using Eq. ( |6C| ) with the values of dimensions and mul- 
tiplicities given by ([!?]) and (^) , the coefficients of the 
state become 



where 



9® 



2i + l 



N 



(62) 



(63) 



+ 4+1 



V. CONCLUSIONS 

In this paper we solved the general problem of opti- 
mal estimation of group transformations in the Bayesian 
framework with uniform prior. For this purpose, we in- 
troduced a class of cost functions generalizing the Holevo 
class for phase estimation, containing the maximum like- 
lihood strategy as a special case. For this family of cost 
functions, we derived the general form of the optimal 
input states, which involves maximal entanglement be- 
tween representation and multiplicity spaces of the group 
action. More precisely, the form of an optimal input state 
is a direct sum of maximally entangled states, and for a 
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given cost function one only needs to optimize the coeffi- 
cients in the sum. Moreover, for any state of the optimal 
form all invariant cost functions lead to the same optimal 
POVM. In this way, it is possible to derive an explicit 
expression for the average cost and to reduce the opti- 
mization of the state to a simple eigenvalue problem. As 
applications of the general result we have given the first 
complete derivation of the ultimate precision limit im- 
posed by Quantum Mechanics in the absolute alignment 
of two Cartesian reference frames, and we have derived 
the optimal estimation of a completely unknown two- 
qubit maximally entangled state with N copies of the 
state. In the present paper we focused attention to com- 
pact groups and finite dimensional Hilbert spaces, never- 
theless an extension of our results to infinite dimensional 
Hilbert spaces and non-compact groups is possible, in the 
same way as in However, since in infinite dimension 
the optimal states may be non-normalizable, one has to 
approximate them with physical states by fixing addi- 
tional constraints as, typically, the energy constraint. 



Acknowledgments 

This work has been supported by the FET European 
Networks on Quantum Information and Communica- 
tion Contract IST-2000-29681:ATESIT, by MIUR 2003- 
Cofinanziamento, and by INFM PRA-2002-CLON. 



VI. APPENDIX 



A. Invariant cost functions 



where = Tr [[/£]. 

Proof. For each irreducible representation fj,, consider 
the matrix elements ufj(g) = (ip^\ Ujf \tpj) with respect 
to a fixed basis B M = ) | i = 1, . . . , d M } for the rep- 
resentation space Tin- Since the collection of all these 
matrix elements is an orthogonal basis for L 2 (G) [fl7|| , 
we can expand the function c(g, g) as 

<9, 9 ) = E E E «w <-(.9) <m , (7i) 

fi,u i,j=l k,l=l 

where the complex conjugate in u^*{g) is for later con- 
venience. Now, the function c is both left- and right- 
invariant, whence it coincides with its average c(g,g) = 
J d k f d h c(kgh : kgh). Using Proposition |l| and Eqs. (|^) 
and (H), we obtain 



c (9:9) = J ' dk J dh c(kgh, kgh) 



E E a m «W x 

fi,v i,j,k,l 

d„ 



i E E <!ti *i« Tr t K-i ] 

M M i,j,l,k 



E a n *?(9 l 9) , 



where a„ = Ei,j a ijij- 



In this section we prove the form ( |18| ) of any invariant 
cost function. 

Proposition 1 The following integral formula holds: 

JdgU^U^=5^ . (69 ) 

Proof. Using Eq. (||), we recognize in the l.h.s. the 
projection onto the subspace of Ti^ <S> H„ that carries the 
trivial representation in the Clebsch-Gordan decomposi- 
tion of U£ <g> Ug *. Using the orthogonality of charac- 
ters, one can prove that such tensor product contains the 
trivial representation if and only if /i = v. Moreover, if 
fi = then the multiplicity of the trivial representation 
is one. Using the property (||), we see that the vector 
is invariant under Ujf (g> Ujf * . Therefore the r.h.s. 
is the projection onto the one-dimensional invariant sub- 
space that carries the trivial representation, whence it 
coincides with the l.h.s. ■ 

Proposition 2 Any invariant function c(g,g) has the 
form 

c(ff, 5 ) = E "vxfig- 1 ® , (70) 



B. Decomposition of a product of maximally 
entangled states 

Here we give the proof for Lemma |[ 

Proof. Consider the representation R(G) = {(U g ® 
1) 0JV | U g E SV(d)}. It is convenient to order the 2N 
Hilbert spaces in the tensor product H® 2N in such a way 
that the unitary operators act on the first TV spaces, while 
the identity operators acts on the second N spaces. With 
this ordering, by defining Ha (Hb) the tensor product of 
the first (second) TV spaces, we have R(G) = Ra(G)(E)^b, 
where R A (G) = {Uf N | U g e §U(d)} is the TV-fold tensor 
representation of §U(rf). 

Let's decompose now the Hilbert space Ha with re- 
spect to the action of the representation R A (G): 

«a=0H,® C m » . (72) 

it* 

The tensor product Ha ®Hb can be decomposed with 
respect to R(G) = Ra(G) ® 1 B as 

Ha®H b =(§ H^® C M " , (73) 



11 



where the multiplicity has been increased to 



"<7i 

spaces. 



x d , since Tis has been absorbed in the multiplicity 



With respect to the factorization H® 2N = Ha 
the input state = \&))® N can be written as 



I*) 



1 



OF 



li 



(74) 



where 1® N is the identity in H A = H® N = H B - Here we 
are using notation (|l|), with respect to the product basis 
g®JV £ Qr -ft A an( j B being a fixed basis for H. Now 
we want to change the basis in Ha, by switching from 
B® N to B' = U M B£ ® where B£ = {|^> 



1. 



,<W (B 



1, . . . , is a basis for 

One 



the representation (multiplicity) space in Eq. (|7: 
has 



=w E E ic>aK)a ioio* 



'd N 



m—l n—1 



m/ I'm/ ' 



where we defined the normalized vector 



(75) 



Therefore, exploiting notation (Q) with respect to the 
bases {I?/ 1 ™)} and {\t{^}}, we can write 



(76) 




/i m— 1 
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